Phylogeny determined by protein domain content.

نویسندگان

  • Song Yang
  • Russell F Doolittle
  • Philip E Bourne
چکیده

A simple classification scheme that uses only the presence or absence of a protein domain architecture has been used to determine the phylogeny of 174 complete genomes. The method correctly divides the 174 taxa into Archaea, Bacteria, and Eukarya and satisfactorily sorts most of the major groups within these superkingdoms. The most challenging problem involved 119 Bacteria, many of which have reduced genomes. When a weighting factor was used that takes account of difference in genome size (number of considered folds), small-genome taxa were mostly grouped with their full-sized counterparts. Although not every organism appears exactly at its classical phylogenetic position in these trees, the agreement appears comparable with the efforts of others by using sophisticated sequence analysis and/or combinations of gene content and gene order. During the course of the study, it emerged that there is a core set of approximately 50 folds that is found in all 174 genomes and a single fold diagnostic of all Archaea.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Differential responses of phenolic compounds of Brassica napus under drought stress.

This work evaluated the effect of drought stress on seedling growth, protein, soluble sugars, and phenolic compounds of two cultivars of canola (RGS003 and Sarigol). Drought stress was induced with polyethylene glycol (PEG) at 0, 5, 10, and 15%. Drought stress increased root fresh weight in both cultivars and the effect of drought was more pronounced in RGS003. Shoot fresh weight reduced in Sar...

متن کامل

Genetic Analysis of Three Structural Proteins in Iranian Infectious Bronchitis Virus Isolate

Infectious bronchitis virus (IBV) is a contagious pathogen in fowl that results in economic loss in the poultry industry. In this study, the amino acids sequences of three structural proteins M, N, and S1 for five Iranian IBV isolated during 1998-2011 have been analyzed. Conserved and variable regions, hydrophobic characteristics and identity matrix were determined after alignment by Bioedit ve...

متن کامل

Variation in Oil, Protein Content and Fatty Acid Composition of Twelve Turkish Opium Poppy (Papaver somniferum L.) Lines

Opium poppy (Papaver somniferum L.) has two major products: alkaloids in the capsules and the seeds.  The seed contains oil, protein, carbohydrate, moisture and mineral matters. The seed oil is rich in unsaturated fatty acids, particularly linoleic and oleic acid. Remaining meals after oil extraction are the important source for animal diets. The United Nations recognize Turkey and India as tra...

متن کامل

Global phylogeny determined by the combination of protein domains in proteomes.

The majority of proteins consist of multiple domains that are either repeated or combined in defined order. In this study, we survey the combination of protein domains defined at fold and fold superfamily levels in 185 genomes belonging to organisms that have been fully sequenced and introduce a method that reconstructs rooted phylogenomic trees from the content and arrangement of domains in pr...

متن کامل

Discovering Domains Mediating Protein Interactions

Background: Protein-protein interactions do not provide any direct information re‌garding the domains within the proteins that mediate the interactions. The majority of proteins are multi domain proteins and the interaction between them is often defined by the pairs of their domains. Most of the former studies focus only on interacting do‌main pairs. However they do not consider the in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Proceedings of the National Academy of Sciences of the United States of America

دوره 102 2  شماره 

صفحات  -

تاریخ انتشار 2005